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Galaxy redshift surveys selected by neutral hydrogen using 
the Five-hundred metre Aperture Spherical Telescope 
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ABSTRACT 

We discuss the possibility of performing a substantial spectroscopic galaxy redshift 
survey selected via the 21cm emission from neutral hydrogen using the Five-hundred 
metre Aperture Spherical Telescope (FAST) to be built in China. We consider issues 
related to the estimation of the source counts and optimizations of the survey, and 
discuss the constraints on cosmological models that such a survey could provide. We 
find that a survey taking around 2 years could detect ~ 10 7 galaxies with an average 
redshift of ~ 0.15 making the survey complementary to those already carried out 
at optical wavelengths. These conservative estimates have used the z = HI mass 
function and have ignored the possibility of evolution. The results could be used to 
constrain T = fl nl h to 5% and the spectral index, n s , to 7% independent of CMB data. 
If we also use simulated power spectra from the Planck satellite, one can constrain w 
to be within 5% of -1. 
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1 INTRODUCTION 



Galaxy redshift surveys have played a significant role in con- 
straining the cosmological model, with the largest surveys 
to date having been performed by the 2dFGR^fand SDSS3 
teams using optical techniques. Low redshift samples have 
been used for some time to derive constraints on cosmolog- 
ical parameter combinations T and /b = fib/fim, where Q m 
and Qb are the total matter and baryon densities defined 
relative to critical, and h = _r7o/(100km s" 1 Mpc -1 ), as well 
as the spectral index of the density flu ct uations, n s , and neu- 
trino densities jTegmark et al.ll2004al lbl; IPercival et allbuOll ; 
ICole et al.ll2005l ). More recently the SDSS team have used 
a more sparsely sampled, but larger volume survey of lumi- 
nous red galaxies (LRGs) to measure the expected baryon 
acous tic peak on scales of ~ 100ft _1 Mpc jEisenstein et all 
120051 ). Since the acoustic scale is a standard ruler, this pro- 
vides a distance measurement allowing further constraints 
to be placed on cosmological parameters and, in particular, 
those which parametrize the properties of the dark energy. 

A large number of projects have been proposed recently 
to develop these ideas and to perform large photometric 



surveys in the optical (and possibly the infra-red) wave- 
band to find ~ 10 8 galaxies out to z ~ 1 (Dark Energy 
Survey; DES3), as well as de eper spectroscopic surveys t o 
find ~ 10 5 galaxies at z ~ 3 (|Blake and Glazebrookll2003l '). 
Such surveys will have much larger search volumes than 
those presently available, allowing for significantly more ac- 
curate measurements of the power spectrum and hence of 
the acoustic scale at a variety of redshifts. This could lead 
to very tight constraints on the properties of the dark en- 
ergy. 

Another promising technique is to use the 21cm 
emission from neutral hydrogen (HI) as a tracer of the 
gala xies allowing for a spectros copic galaxy redshift sur- 
vey (|Abdalla and Rawlingsll2004l ). This will require the de- 
velopment of telescopes with much higher survey speed, 
and hence a combination of larger collecting area and field- 
of-view (FoV), than those available at present. The cur- 
rent state-of-t he-art is a shallow all-sky surve y; the com- 
bined HIPASS (|Zwaan et al.ll2005l ) and HIJASS (|Lang et all 
I2003D survey, which has found ~ 10 4 galaxies. Fortunately, a 
world-wide technology development programme is underway 
with the ultimate aim of building a Square Kilometre Ar- 
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ray (SKAQ) with collecting area ~ 10 6 m 2 and a FoV much 
greater than ldeg 2 , with these specifications the SKA sur- 
veys may detect ~ 10 9 galaxies. 

Before the full SKA comes into operation (~ 2020) 
another large radio telescope will have been constructed 
which also has great potential for HI surveys. The Five- 
hundred-metre Aperture Spherical Telescope (FAST) will be 
an Arecibo-like teles cope with a significantly larger aperture 
(~ 500 m) and FoV l |NarJl2006l ). The aim of this paper is to 
discuss the scientific potential of a low-redshift HI survey of 
galaxies as a FAST key science programme. We will show 
that, if a focal plane array with 100 instantaneous beams 
can be developed, ~ 10 7 galaxies can be detected within a 
realistic survey period (~ 2 years) and that the low-redshift 
galaxy power spectrum can be measured much more accu- 
rately than is currently possible with optical surveys. Since 
hydrogen is the fundamental baryonic constituent of the uni- 
verse such a survey would provide an important view of the 
large-scale structure of the universe with different biases and 
be an important stepping stone to the larger HI galaxy sur- 
veys which will become possible with the SKA. 

In the next section we will briefly discuss some relevant 
observational parameters of FAST and then in subsequent 
sections, we will estimate the number counts, make fore- 
casts for errors on the power spectrum and constraints on 
cosmological parameters that could be possible with such a 
survey. 



2 FIVE-HUNDRED-METRE APERTURE 
SPHERICAL TELESCOPE (FAST) 

The FAST telescope is funded and when completed in 
around 2012-2013 it will be the largest single dish tele- 
scope in the wor ld. The expected specifications of the tele- 
scope (|NarJl2006l ) relevant to our discussion here are: 

• The prime-focus feed illuminates an area correspond- 
ing to a parabolic dish of diameter ~ 300m with aperture 
efficiency of 70% giving A eS = 50000m 2 , 

• At 21cm wavelength the beam width (FWHM) 0^3 
arcmin; the effective pixel (equating the volume of a cube of 
height unity to the integral under a gaussian beam with the 
same height) has side 1.0646 and area 1.1330 2 . 

• The maximum observable zenith angle is of order 40° 
allowing the observation of ~ 50% of the full sky without 
significant degradation of performance. Henceforth, the 'full 
sky' is defined to be the maximum amount of sky accessible 
to the telescope i.e. ~ 20000deg . 

• One of the principal receiver systems will be a focal 
plane array covering a frequency range around 1.4 GHz 
(the 21-cm HI line) which can produce ns instantaneous 
beams each with a system temperature T sys = 25K (hence 
A eS /T sys « 2000 m 2 A-" 1 per beam) and dual polarization 
capability. 

The expected thermal noise for a dual polarization single 
beam can be computed using 



for an observing time of t and a frequency bandwidth of Av, 
where k = 1380Jym 2 K _1 is the Boltzmann constant. If 
we assume a bandwidth of Av — 1 MHz which corresponds 
to a velocity linewidth of w 200 km s -1 at 21cm then the 
instantaneous sensitivity of each beam of the FAST system 
will be CTmst ~ 1 mjy s 1 ' 2 . 

The focal ratio of the telescope (f/D = 0.47) restricts 
7iB to ^ 19 for a conventional close-packed horn-based ar- 
ray, similar to that used for HIPASS and operated at the 
prime focus. Larger arrays require some of the horns to 
be far enough off-axis that aberration losses become unac- 
ceptable. A 19-beam array gives an instantaneous FoV of 
fiinst ~ 200 arcmin 2 . We will use this 19-beam system as 
the fiducial benchmark for our basic calculations. A receiver 
array with jib ~ 100 may well be possible. This require- 
ment could only be met with close-packed phased arrays of 
small antenna elements; these would not take up much more 
area than the 19-beam horn system. The required phased ar- 
ray technology is actively being developed by several groups 
working within the international Square Kilometre Array 
R&D efforiQ and hence we can be confident that by the 
time FAST comes into operation in 2012-3 the 100-beam 
receiver capability will be available. This would allow for 
f^inst ~ (ne/19) 200 arcmin 2 . Such a system will be at least 
twice as sensitive per beam as that on the Arecibo Telescope 
and have a survey speed (f2i n st(A e £f/T' S ya) 2 ) which is at least 
6(tib/19) times faster than the 7-beam system presently 
available there. FAST will also have more frequency flexi- 
bility due to the excellent RFI enviroment at the FAST site 
and will be able to observe twice the area of the sky than 
does the Arecibo Telescope. 



3 ESTIMATING GALAXY COUNTS 

3.1 Properties of the HI galaxy distribution 

The HI mass, Mm, of a galaxy at redshift z is given in terms 
of th e observed flux, S, and line width, AV , by (I Roberts! 
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km sec -1 



(->) 



Mm _ 2.35 x 10 s / d L (z) V f S_\ t 
M Q ~ 1 + z \Mpc J \3y) { 

where di J (z) is the luminosity distance to the galaxy. We note 
that this formula is usually quoted in terms of the Euclidean 
distance and without the (1+z) -1 ; this is the correct formula 
in an FRW universe. 

One can estimate the number of galaxies above some 
limiting mass Mii m (z) by computing 

dV 
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where the sky area covered is AQ and the size of the redshift 
bin is Az. dV/dzdQ, is the comoving volume element for 
the FRW universe and dN/dVdM is the comoving number 
density of galaxies per unit mass which we shall assume can 
be approximated by a Schechter function 
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We will, for the most part, assume no evolution in the HI 
mass function out to the moderate redshifts appropriate 
for a FAST survey and, therefore, use values of the pa- 
rameters derived from the m ost recent survey of the local 
universe (|Zwaan et alj|2005h . In particular, we will choose 
a = -1.37, 6* = 1.42 x 10" 2 /i 3 Mpc -3 and M m = 10 9 8 M Q . 

One can compute the average redshift of galaxies in the 
survey from N(M > Mu m ,z) by integrating appropriately 
over z, that is, 



100 



f~zN(M > M lim ,z)dz 
J °° N(M > My im , z)dz 



(5) 



In order to compute the limiting mass of the survey we 
will make a point source approximation, that is, we will as- 
sume that each galaxy detected will be smaller than the tele- 
scope beam, and hence, all the flux it emits will contribute 
to a single beam. This is likely to be a good approximation 
for most objects in the survey, but will break down for the 
closest, most massive objects. We expect those objects which 
violate this assumption to be the minority if we choose the 
survey strategy sensibly. 

There is an empirically derived relation between the HI 
diameter, Dm, which is defined to be the region inside which 
the HI surface density is greater than 1M© pc~ 2 , and the HI 
mass of a galaxy. This can be used to investigate the extent 
to which the point source approximation is valid. We will 
use (jBroeils and Rhedll997l ; IVerheiien and Sancisill200ll ) 



-Dm 
kpc 



10 6 - 8 M, 



(6) 



which has been converted to an angular scale using the an- 
gular diameter distance <1a(z) for the range of relevant HI 
masses in Fig.[T] We note immediately that at z m 0.02 mas- 
sive galaxies containing 10 10 M© of HI will be resolved by the 
FAST beam (see Fig. [1} but we postpone more detailed dis- 
cussion of the validity of the point source assumptions to 
later in section B. 

The beam area increases like A 2 oc (1 + z) 2 . As a re- 
sult if an optimal survey strategy, in which one uniformly 
tiles the z = slice, is chosen then slices of the survey at 
higher redshift receive extra exposure due to the fact that 
thes e slices will overlap. The effe ct has been considered in 
ref. (|Abdalla and Rawlingsl 120041 ) where it was shown that 
the flux limit relevant to a particular redshift slice is reduced 
by (1 + z) -1 . Hence, the flux limit for an observation, Sum, 
for a specific signal-to-noise ratio {S/N) is given by 



Sum = (S/N) 



1 + z 



(7) 



A given object is detectable if its flux density in bandwidth 
Av — i/AVo/c is greater than 5n m . 



3.2 Simple estimates assuming a fixed observing 
bandwidth 

A simple assumption, which should yield an order of magni- 
tude estimate of the number of objects that one might expect 
to find in a survey, is that the line width of all the galax- 
ies is AV = 200 km s -1 . Under this assumption we have 
computed Mu m (z) and the number of galaxies one would 
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Figure 1. The angular sizes (in arc minutes) for galaxies with HI 
masses 10 10 M Q , dotted, 10 9 M Q , dashed, and 1O 8 M , dot-dash. 
The (approximately) horizontal large-dashed line represents the 
FWHM beamsize of FAST which scales as A oc 1 + z. 
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Table 1. Predicted number counts with S/N = 4 in one day of 
observation using a receiver system with rtB = 19 for a range of 
observation strategies. Counts expected for longer observations, 
or for larger numbers of beams, can be computed using a simple 
linear scaling. The limiting fluxes are computed using Eqn.[7]with 
z = 0. All numbers have been rounded to 2 significant figures. 



expect to find in an average day of observation as a function 
of tobs, the amount of time spent observing each beam area. 
Allowing for calibration and telescope maintainance, we will 
use an on-source integration time of 18 hours to represent 
one day. The mass limit and number counts as a function of 
redshift (with bin width Az = 0.01) are presented in Fig. [5] 
for S/N = 4 for iib = 19. The probability of a spurious de- 
tection will be ~ 6 x 10 -5 , assuming a Gaussian distribution 
for the noise. Since an all sky survey will contain ~4x 10 9 
independent volume elements (the number of independent 
beam areas multiplied by the number of indepndent depth 
slices) this would mean ~ 2 x 10 5 spurious detections. In 
such a survey, therefore, there will be around 1% spurious 
detections (we will find that around 10 7 galaxies can be de- 
tected). This will not significantly affect the measurement 
of the power spectrum discussed in the later sections of this 
paper. 
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Figure 2. a) the S/7V = 4 limiting mass for different integration 
times (in seconds) per pointing: i obs = 6 (dotted), 60 (short- 
dash), 600 (long-dash), 6000 (dot-short dash), 60000 (dot- long 
dash), b) the predicted number counts in a day of observation 
(defined as 18 hours of on-source integration time) for each of the 
observation strategies for «b = 19. The bin width is Az = 0.01. 
The total number of galaxies predicted in each case is summa- 
rized in Table [T] The solid line (that is, the lowest) in panel a) 
represents the estimated HI mass in a typical volume enclosed 
by the beam area and the fiducial velocity width 200 km s -1 . If 
this line is greater than a substantive fraction (say, 1/3) of the 
mass limit one would expect a confusion related increase in the 
observed noise. 



One can immediately scale the number counts discussed 
in the previous paragraph to longer integration times and 
larger numbers of beams. If t aurvC y is the total time of the 
survey and the focal plane array has tib beams then the 
number of galaxies detected at a particular signal-to-noise 
ratio is given by 

;)(-'• 



N(UB, tsurvoy) 



TIB 

19. 



Id 



N(n B = 19, t = 1 d) , (8) 



although one has to be careful to make sure that t aurvoy < 
t s ky, the time taken to cover the entire sky available to 
FAST, which is given in terms of tobs by 



(9) 



that is, the full 20000 deg will be covered in 5.5 days for 
a system with 19 beams if one spent 1 s per pointing. We 
remind the reader that we have assumed that 1 day corre- 
sponds to 18 hours of on-source integration time. 

Under the assumptions of this calculation, we deduce, 
for example, that a survey of the whole sky available to 
FAST with a noise level of cr no i sc w 40 /iJy per beam is pos- 
sible by performing 600 s integrations in around 9 years if 
riB = 19, but that this would take less than 2 years with 
7iB ~ 100. Such a survey would find around 7 x 10 6 galaxies 
with S/N = 4 with an average redshift (z) w 0.15. In sub- 
sequent sections, we will improve the validity of the calcula- 
tion by modelling additional important effects. However, we 
shall see that this simple calculation holds true as a first or- 
der estimate. It is already clear that, if such a cosmologically 
important redshift survey is to be completed in an accept- 
able amount of time, a focal plane array with «b ~ 100 is 
essential. 

We can check a posteriori that the point source approxi- 
mation that we have made is valid by examining the limiting 
mass and expected number counts at very low redshift. We 
note first that the number counts tend to zero as z tends 
to zero since the volume element is oc z 2 . Thus, even in the 
case of tobs = 6 s, there are very few objects with z < 0.02, 
the point at which Mm = 10 10 M© galaxies are resolved by 
FAST. The percentage of low redshift objects will decrease 
as tobs increases. We estimate that the percentage of galax- 
ies that would be significantly resolved for t b s = 6 s is less 
than 10% and for t b s > 60 s the effect will be negligible. 

In the most extreme circumstances, one might also be 
sensitive to the effects of confusion even in a spectroscopic 
survey since the FAST bcamsize is likely to be much larger 
than the size of the galaxies we are trying to detect. One 
can attempt to quantify the expected amount of neutral 
hydrogen in volume of the universe enclosed by a Gaussian 
beam with area !Qb = 7rr?| WHM /(4 log 2) and the fiducial 
linewidth AV = 200 km s -1 . This is given by 

(Af HI )=p H :(,)(l + ^B^, ( 1Q ) 

where conservatively we assume that pm(z) = pHi(0)(l + 
z) 3 . We take the local HI density t o be n H i = 2 6 x lO " 4 ^" 1 
as found in the HIPASS survey (|Zwaan et alj|2005h . that 
is, jOHi(0) w 7.2 x 10 7 hM Mpc" 3 . The computed value of 
(Mm){z) is compared to the mass limits for various values 
of £ obs in Fig. [21 It is clear from this that the effects of 
confusion are only significant for large values of i b s ; one 
might imagine that if {Mm} > Mn m /3 then confusion could 
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lead to an increase in the effective noise making it more 
difficult to find galaxies. 

From the above discussion it seems sensible to consider 
values of 60 s < i D b s < 6000 s, in order to avoid any possible 
corrections to the statistics due to resolving galaxies at low 
z and confusion at high z. 

3.3 Mass and inclination dependent effects in the 
linewidth 

In this section we attempt to improve our calculation of 
the number counts by taking into account the fact that not 
all galaxies have the same observed linewidth AV . This 
can happen for two reasons: first the intrinsic linewidth is 
expected to depend on the mass for virialized systems, and 
secondly not all galaxies will be observed edge-on. If they are 
observed face-on, then it is the motion of HI perpendicular 
to the disk which sets the linewidth and not rotation. 

The intrinsic linewidth of a galaxy, corrected for broad- 
ening, ha s been shown empirically to be related to the HI 
mass by (|Briggs and Raoll 19931 ; lLang et al.ll2003h 



AV« 



( M B 



420 kms- 



\10 10 M Q 



(11) 



although we note that this relation shows a large dispersion, 
especially for dwarf galaxies. 

The observed linewidth of a galaxy, AVe, which sub- 
tends an angle 8 between its spin axis and the line-of- 
sight ca n be computed using the Tully-Fouque rotation 
scheme (|Tullv and Fouaueil 19851 ) 

(AV c sin(<9)) 2 = 



(AV e f + (AV t ) 
2AV e AV t | 1 - e 



-2(AK) V 



' AV e N 



(12) 



AV = 120 km s -1 represents an intermediate transition be- 
tween the small galaxies with Gaussian HI profiles in which 
the velocity contributions add quadratically and giant galax- 
ies with a 'boxy' profile reproduced by the linear addition 
of the velocity terms. AVt ~ 20kms~ 1 is the velocity width 
due to random motions in the d isk |Rhee and van Albadal 
ll996l ; IVerheiien and Sancisill200ll ). 

With this definition of 8, 8 = corresponds to face- 
on and 8 = n/2 to edge-on. In cases where AVe » AV C , 
one can see that AVg = AVt + AV c sin#. For 8 = 0, one 
finds that AVg = AVt whereas for 8 = n/2 one finds that 
AVe = AV t + AVo as expected. 

In addition there is a broadening effect, AVinst, of the 
HI profile due to the frequency resolution of the instrument, 
R. For a range of gal axy profiles, this broa dening is found to 
be AVinst « 0.557? l|Bottinelli et al.lll990i ). An appropriate 
value for the present discussion is AVmst ~ 16 km s~ . It has 
been shown (|Lang et alj|2003h that AVinst should be added 
linearly to AVe to give the effective observed linewidth, 

AV (#) = AVe + AVinst • (13) 

AVe can be computed by using the Newton-Raphson 
method to find the root of Eqn. [12] for a given AV c (Mm)- 
The effective observed linewidth is illustrated in Fig. as a 
function of AV C for an edge on galaxy and as function of 8 
for AV e = 420 km s" 1 . 
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Figure 3. The effective observed linewidth AV as a function of 
a) AV e for 9 = w/2, that is edge on; b) 8 for AV C = 420 kms -1 . 



The limiting mass will now depend on 8 as well as z, 
Mn m (z, 8), since it will be easier to find objects which have 
8 — 0. In order to incorporate this we have take into account 
the probability of a given angle 8, V{8), when computing the 
number counts 



N (M > Ml im ) 

where 
dN 



r/2 



d8V{8) 



dN 
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^„ dV f°° dN 

— = AD.Az / 

d8 dzdQ, J B) dVdM 



dM. 



(14) 



(15) 



As an illustration, we have plotted dN/d8 against cos# at 
z = 0.1 with Az = 0.01 for t obs = 600 s and S/N = 4 
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Figure 4. The effect of the angle of inclination on the detectabil- 
ity of galaxies. We have used z = 0.1, Az = 0.01, t b s = 600 s 
and S/N = 4. 



in Fig. |4] which shows that objects with 9 close to zero are 
preferentially selected. 

A simple assumption would be that the galaxies are ran- 
domly distributed on the celestial sphere and that the spin 
axes are also randomly distributed. This is not the case for 
satallite galaxies around their central galaxy, where a small 
anisotropy of the order of a few percent has been observed 
in SDSS data (|Yang et al.ll2006t ). but it is likely to be a good 
approximation for the FAST surveys discussed here which 
will preferentially select large, well-separated galaxies with 
average separation w 5Mpc. In this case V(&) = sin 8 and 
therefore one can think of the value of cos 9 being uniformly 
distributed. Hence, one can deduce that 



Table 2. The equivalent of Table \T\ when the effects of mass and 
inclination on the observed linewidth are taken into account. The 
value of Sij m is dependent both on the mass and angle of inclina- 
tion in this case, we have, therefore, not attempted to include it. 
All numbers have been rounded to two significant figures. 



forward is that the amount of HI is likely to increase as we 
go back in cosmic time and that the estimate of number 
counts based on the HI mass function measured at z = 
is likely to be a lower limit of the number counts found by 
a particular survey. We will make a simple assumption that 
the relevant galaxies are formed by the epochs probed by 
FAST, and that HI is being used as the fuel for star for- 
mation, and hence is reducing in each of the galaxies as a 
function of time. We note that this passive star formation 
assumption ignores the possibility of mergers. We further 
note that this is only a toy model and is of course far too 
simplistic to be considered a realistic portrayal of the intri- 
cate physical processes acting on the HI mass function. It is 
designed solely as an illustrative example. 

If the HI mass function is well-represented by a 
Schechter function then the total density of HI is given by 



Phi = r(a + 2)6»* A/hi 



(17) 



If we now make the assumption that the faint-end slope 
a does not evolve with redshift, and that 9* 7^ 9*( z ), we 
can model the effective change in the hydrogen mass due 
to passive star formation as a net shift of the break M HI = 
A/ HI (z) with redshift. We assume that a simple form of this 
evolution 



(16)M H i(«) = Mhi(0)(1 + /3 2 



(18) 



The results of taking account these effects are presented 
in Tableland Fig. [S] It is clear that the number counts are 
slightly larger, by around 30%, than under the assumption 
that AT4 = 200 km s -1 . The scaling formulae discussed in 
section 13.21 apply again here. For a survey with na « 100 
taking around 2 years with tobs = 600 sees one would be 
able to cover all the sky available to FAST and find ~ 10 7 
galaxies. 



3.4 Effects of evolution 

The calculations of the previous section are based on the 
assumption that the HI mass function does not evolve. In 
this section we discuss the possible effects of evolution on 
the number counts. It should be noted that none of these 
evolving models are used in the cosmological parameter con- 
straints in Section [4] The basic picture which we will put 



where 8 is some constant of proportionality that determines 
the rate at which the hydrogen mass is consumed by star 
formation. This should be valid at low redshifts. In the sub- 
sequent discussion we will assume that ^ j3 ^ 3 in this 
model. 

One can determine values compatible with present 
knowledge of the evolution of fim- Under the assumption 
of Eqn. [TS]the total neutral hydrogen density of Eqn. 1171 is 
given by 

pm = I\a + 2)6>*M HI (0)(1 + Bz) = pm{z = 0)(1 + Bz) . (19) 

Using the cosmic density of neutral gas fi g (z) (measured 
relative to the present day critical density) one can estimate 
the value of (5 by assuming that the fraction of HI and He 
is constant between z — and 2 = 1. By taking the ratio of 
measured neutral gas cosmic densities in the local Universe, 
f2 g (2 ~ 0) = 3.5 x 10~ 4 , and from damped Lyman-a systems 
at hig her-redshift (|Prochaska et al] .2005: Ra o and Turnshekl 
|2000| ). fi g (z w 1) = l.Ox 10" 3 , we ca n estimate 3 ~ 2 which 
is also compatible with other works (|Lah et al.ll2007l ). 
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Figure 5. The equivalent of Fig.[2]for the case of FAST with 100 
beams when the effects of mass and inclination on the observed 
lincwidth are taken into account. 



Figure 6. The predicted daily number counts with S/N = 4, 
Az = 0.01 binwidth, for the FAST operating with n B = 100 
and t t, s = 600 s as a function of redshift with a simple rcdshift- 
cvolving mass break in the Schechter function of Eqn. [4] This 
is shown as a factor /3 = [0, 1, 2,3] for the solid, dot, dash, dot- 
dashed lines respectively. 



as being of legitimate scientific interest in its own right such 
a survey would allow one to investigate the optimal depth of 
the main redshift survey and pin-down some of the questions 
raised in this section. In Fig. [7] we show the redshift distri- 
bution of galaxies that one would expect for an HI survey 
with fobs = 6000s lasting 30 d using the z — mass func- 
tion and bins Az = 0.1. One would expected to find around 
42000 galaxies with (2) w 0.3 using n B = 19 and 222000 
for 71b = 100. In order to estimate accurately the parame- 
ters of the mass function, we estimate that one would require 
~ 1000 galaxies per bin. Therefore, we believe that it should 
be possible to determine the HI mass function out to z ~ 0.6 
for tib = 19 and to z ~ 1 tib = 100 with such a month-long 
survey. 



The effects of including this mass evolution for are 
shown in Fig.[S]for the 100-beam FAST with t obs = 600 s. It 
is clear that evolution leads to an increase in the number of 
galaxies found per day which are 14420, 19070, 25190, 32780 
for j3 = 0, 1, 2, 3 and the median redshift increases with f3. 
Essentially, we have show that one can take the current pre- 
diction of ~ 10 7 galaxies found by the 100-beam FAST in 2 
years of observation is a conservative lower bound. If we take 
seriously the value of j3 — 2 then one would expect to find 
nearly twice as many galaxies as in the case of no evolution. 

Prior to embarking on the full, all-sky galaxy redshift 
survey, it would be sensible to perform an exploratory, 
deeper survey of a much smaller area, with the objective 
of measuring the evolution of the HI mass function. As well 



4 EXPECTED ERRORS ON THE MATTER 
POWER SPECTRUM 

We now use the predicted galaxy number counts, assuming 
no evolution of the kind described in the last section, to 
estimate the errors of the galaxy power spectrum at z = 
(z) w 0.15. P(k, z) is related to the power spectrum P(k, 0) 
by 

P{k,z) = [D(z)] 2 P(k), (20) 
where D(z) is the growth factor computed from 



D(z) = ^E{z) 



(l + z')dz' 



(21) 



an E(z) = H(z)/H . 

Errors on the power spectrum are due to two factors: 
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Figure 7. The number of galaxies that would be found in a survey 
lasting 30d with t b s = 6000 s. In the case of rip, = 19 one expect 
to find around 42000 galaxies with (z) 0.3 in 17deg 2 , whereas 
one might find as many as 222000 in 87deg 2 for rip = 100. We 



have used Az = 0.1 and the . 



HI mass function in both cases. 



sample variance, i.e. the fact that not all k modes are mea- 
sured, and shot-noise which is the effective noise on the mea- 
surement of an individual mode. The total error op on the 
measurement of the power spectrum, P(k,z), for a given 
k with logarithmic bin width A(log k ) can be expressed 
as ijFeldman et al.lll994l : lTeemarklll997h 



(¥)' 



l 



(2-) 3 



( 1 + nP y 
\ nP ) 



(22) 



47rfc 3 A(logfc) Kff(fc) V nP 

where P = P(k, z) and n = n(z) is the number density of 
galaxies which are detected (making nP dimensionless) 

dN 



n(z) 



dVdM 



dM . 



(23) 



and Vcs(k) is the effective survey volume probed for a par- 
ticular fc-mode 

2 



Veff(fc) = Afi 



Jo U + nJ 



dV 
dzdfl 



{z)dz . 



(24) 



For definiteness, we will consider a survey taking two 
years performed by the FAST telescope with a focal plane 
array with rip — 100 and t a ^ B = 600 s per instantaneous 
FoV, fijnst ~ lOOOarcmin 2 . The choice of £ ODS = 600 s is 
simply the required survey time for scanning the full sky in 
2 years and hence is the most efficient use of the telescopes 
resources. The justification for such an observation strategy 
is the simple fact that the area covered is inversely linear 
in integration time per field while the depth attained goes 
as the square root of the integration time; the most cos- 
mic volume is attained from full sky surveys. We have also 
calculated the effect of probing different baryonic acoustic 
modes in the sky with varying observation strategies and de- 
termined that the simple maxim of 'wide-shallow' is always 
preferable to 'narrow-deep' in large-scale galaxy surveys. Us- 



ing the methods of section 1331 we have estimated that such 
a survey will find ~ 10 7 galaxies with S/N — 4, the vast 
majority of which are spatially unresolved. 

The expected errors on the power spectrum are pre- 
sented in Fig. [8] for A(logfc) = 0.04, which should be suffi- 
ciently large to ensure the errors are uncorrelated, ac c ordin g 
to a simple prescription detailed in iMeiskin et al.l (|l999h . 
The errors increase at low k due to sample variance and 
would also increase at high k due to shot noise were it not 
for our choice of a logarithmic binning; that is, the bins do 
not contain an equal number of fc-modes. We have also di- 
vided the power spectrum by a model with no baryons which 
illustrates the acoustic features in the spectrum. The average 
redshift (z) w 0.15 will prevent much cosmological informa- 
tion being gleaned from the acoustic scale and therefore we 
will not use it explicitly in our estimates of the cosmological 
parameters. It is clear, however, from Fig. [8] that the detec- 
tion of the baryonic features would be possible with these 
data. 

It is now interesting to compare the characteristics of 
the proposed FAST survey with those presently available. 
The effective volume of the proposed FAST survey evaluated 
at the k — k\ = 0.075ft Mpc - scale of the acoustic peak in 
the power spectrum, is V c s = 0.82 x 10 9 /i -3 Mpc 3 . This is 
shown in Fig. [9] where the effective volume probed, as well 
as the galaxy number count, of a variety of optical survey 
schemes are given, namely the SDSS main and SDSS LRG 
results as well as 2dFGRS, in comparison with FAST iib = 
19 for both i b s = 120 s and fobs = 600 s survey modes as 
well as our proposed version with np = 100 and t ohB = 600 s. 



ERROR FORECASTS FOR COSMOLOGICAL 
PARAMETERS 



In the previous section we have shown that a survey us- 
ing the FAST with n B = 100 would find w 10 7 galaxies. 
It is interesting to determine the improvement that this 
would bring to cosmological parameter estimation through 
the accurate sampling of the galactic power spectrum rel- 
ative to current large-scale optical surveys; 2dFGRS and 
SDSS. Since the FAST survey will have similar redshift cov- 
erage, but in a different waveband, this would bring more 
than just an improvement in signal-to-noise. 

For the 2dFGR S survey we use the data presented 
in l|Cole et al.l 120051 ). We select 32 bandpowers in the 
range 0.022 < k/h (Mpc" 1 ) < 0.147, where non- linear ef- 
fects aje_e2cpjMted_to be small in this region. Even so, in 
ref. (|Cole et al.ll2005h it was proposed to use a fitting func- 
tion of the form 



(25) 



P obs (k)=b 2 ±±^-P lin (k), 

to model the residual non-linear effects on the original, lin- 
ear, power spectrum Pu n (fc). For the 2dFGRS data, the 
authors advocate using the parameters A — 1.4 Mpc -1 
and Q = 4 Mpc -2 . We follow this treatment, and analyt- 
ically marginalize over the bias parameter b in the subse- 
quent Markov-Chain Monte-Carlo (MCMC ) analysis. For 
the SD SS data, we use data presented in ref. (|Tegmark et al.l 
l2004al ). using 19 bandpowers in the range 0.016 < 
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Parameter 


SDSS 


2dFGRS 


FAST (19/120) 


FAST (19/600) 


FAST (100/600) 


8Cl m /Q m 


0.416 


0.221 


0.230 


0.310 


0.078 


sr/r 


0.328 


0.161 


0.166 


0.231 


0.049 


Sfb/fb 


0.378 


0.253 


0.468 


0.486 


0.283 


Sn s /n s 


0.251 


0.141 


0.148 


0.178 


0.071 



Table 3. Summary of parameter values which come from the SDSS main sample and 2dFGRS, and the simulated constraints expected 
from the proposed FAST surveys with : (i) n B = 100, t b s = 600 s; (ii) n B = 19, t b s = 120 s; (i) n B = 19, t D b a = 600 s. Values presented 
arc the fractional errors for each parameter. 




k [h Mpc-'] 




k [h Mpc-'] 

Figure 8. a) the expected errors on the power spectrum for the 
proposed FAST survey; b) the same errors, but on a scale where 
the power spectrum has been divided by the equivalent no baryon 
model. 
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Figure 9. The effective V e ff at k = 0.075/fMpc -1 for various 
completed and proposed surveys plotted against the approximate 
number of galaxies they have or might find. The volumes are 
quoted in terms of that of the SDSS main galaxy sample which 
has Vsdss = 1-5 x 10 8 /i" 3 Mpc 3 . The open square is for the SDSS 
main sample, the cross is for the SDSS LRG sample, the hexagon 
is for the 2dFGRS, the pentagon is FAST with n B = 19 with 
*obs = 600 s while the circle is with i t, s = 120 s and the open 
triangle is for the proposed FAST survey taking 2 years with 
n B = 100. 



fc/(/iMpc _1 ) < 0.205. We take the power spectrum of galax- 
ies to be of the form P b s (fc) = b 2 Py ln (k) , and analyti- 
cally marginalize over the bias. For each of these datasets 
we u se the window functio ns provided by (|Cole et al.ll2005h 
and (|Tegmark et al.|[2004al ) respectively. 

For the FAST data, we generate d a fiducial power 
spectrum using the CAMB software (|Lewis et al.l |2000| ) 
using cosmological param eters that best-fit the ACDM 
model (jSpergel et al.ll200a ), in this case [D. c h 2 , Ob/i 2 , h, n S) 
A s ] are given by [0.104, 0.0223, 0.734, 0.951, 2.02 x 10" 9 ], 
where A s is the initial amplitude of fluctuations. We then 
sample Pu n {k) at closely spaced values of fc/h, from which 
we interpolate to obtain Pn n (fc) at the appropriate FAST 
survey fc/h values, along with the simulated error bars. 

For completeness we have investigated three FAST sur- 
vey schemes, namely the fiducial 19-beam FAST, with in- 
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tegration times per pixel of t a ^ B = 120 s and 600 sees, 
and the 100-beam system with 600 seconds of integration 
per pixel each of which would take ~ 2 years to com- 
plete. For each scheme we use 28 bandpowers in the range 
0.005 < fc/(hMpc _1 ) < 0.155, and take the galaxy power 
spectrum to have the form P b s (fc) = b 2 Py ln (k) . We also as- 
sume that the errors on each of the bins are uncorrelated 
due to the large binwidths taken in k as mentioned in Sec- 
tion U The 19-beam survey with f Q b s = 120 s, which could 
find w 3 x 10 6 , galaxies would cover the full 20000 deg 2 , al- 
beit to a shallower depth than the 100-beam case, whereas 
the 19-beam i D b s = 600 s survey would only cover 4000 deg 2 
and find w 2 x 10 6 galaxies. 

For the MCMC analysis we use 

C0SM0MC (|Lewis and Bridle! 120021 ) to create chains to 
estimate the confidence limits on the cosmological param- 
eters. Since large parameter degeneracies exist when only 
including large-scale structure data in the fit, we impose 
the consistency relation that the angle ^acoustic subtended 
by the first acoustic peak is 1.04 0, which is strongl y 
constrained by the WMAP3 data (|Spergel et all l2006h . 
This essentially leaves three remaining free parameters in 
the fit: Q c h 2 ; flbh 2 and n s , which are used to compute the 
parameter V = Q^h and /b = fib/f^m- This is because we 
marginalize over bias, which absorbs the initial amplitude 
of fluctuations, and h can then be derived from these 
parameters and # acous tic. 

In Table [3] we present the fractional error on each of 
the marginalized cosmological parameters for the 2dFGRS 
and SDSS datasets, along with the three FAST observing 
schemes. The errorbars on /b are of a similarly poor level 
to those possible with 2dFGRS and SDSS. This is probably 
since dependence of the power spectrum on /b is weak and 
the present surveys have reached the ceiling on how well this 
can be measured from the matter power spectrum. However, 
the errors on F and n s are significantly improved for the case 
of 71b = 100 and t a b s = 600 s, the area of the error ellipse 
in the F — n s direction is reduced by about a factor of six 
from that possible with 2dFGRS. It is clear, however, that 
the FAST surveys with tib = 19 do not significantly improve 
on the constraints already available. Of the two, that with 
tabs = 120s which covers the whole sky does much better 
than t bs = 600 s making it clear, yet again, that surveys 
which find the largest number of objects (and which cover 
all the available sky) will typically constrain cosmological 
parameters the best. 

In Fig. [10] we present marginalized 2D likelihoods of F 
versus /b . Here, we impose a pri or of n a = 0.95 ± .02, as 
preferred by the WMAP3 data (jSpergel et alj l200rj ) . The 
first thing that is apparent is that the SDSS and 2dFGRS 
do not agree on the central value; something which has been 
noted in the literature and is thought to be asso ciated with 
the d ifferent selection criteria for the two surveys (|Cole et al.l 
l2006h . What is also evident is that FAST with n B = 100 per- 
forms considerably better than the two presently available 
surveys. 

Low redshift galaxy redshift surveys cannot constrain 
the properties of dark energy directly, but they can in com- 
bination with measurements of the CMB. Essentially, the 
measurement of F by the redshift survey breaks the angular 
diameter degeneracy, allowing simultaneous measurements 
of h and the equation-of-state parameter of the dark energy 



Parameter 


Planck 


Planck + FAST 100/600 


Vl b h 2 


0.0223 ±0.0002 


0.0223 ±0.0002 


n c h 2 


0.104 ±0.002 


0.104 ±0.002 




0.952 ± 0.005 


0.952 ± 0.005 


log(10 10 A s ) 


3.01 ±0.01 


3.01 ±0.01 


h 


0.751 ±0.131 


0.731 ±0.015 


T 


0.090 ± 0.006 


0.090 ± 0.0006 


W 


-1.02 ±0.328 


-0.99 ±0.05 



Table 4. Expected errors for a 7 parameter fit using Planck alone 
and Planck + FAST 100/600. There is a degeneracy between w 
and h for Planck only, which is broken by the inclusion of FAST, r 
is the optical depth to rcionization, asssumcd to be instantaneous, 
and A s is the standard power spectrum amplitude. 



w = Pj p. In order to see how FAST could impact on this 
we have performed a joint analysis using a simulated Planck 
dataset using a noise power spectrum for the temperature 
anisotropies out to ^ max = 2000 of Nj T = 9 x 10~ 5 /iK 2 , and 
that for the E-mode polarization given byAf E = 4Nj T . We 
have assumed full sky observations and that optimal un- 
correlated errors can be reconstructed from the data. The 
results are presented in table [4] which show that w can be 
measured to within 5% of -1, and h to within w 2%. 



6 COMPARISON WITH EARLY-PHASE SKA 

An early phase of the SKA with, say, 10% of the total collect- 
ing area (that is, an effective collecting area of ~ 70, 000m 2 
and thus ~ 40% larger than FAST) and a FoV of at least 
fiinst = 1 deg 2 (that is more than three times larger than the 
100-beam case) is likely to come into operation during the 
first five years of FAST operation. It is interesting to com- 
pare and contrast the likely performance of such an early- 
stage SKA relative to FAST. We will assume that, for HI 
observations, the value of T sys for the 10% SKA is the same 
as for FAST (we note that it is likely to be slightly higher, 
but not appreciably so; our results can be scaled by T s ~ 2 
where appropriate). The angular resolution will, however, 
be much higher since the baselines on the SKA "core" will 
extend out to ~ 5 km and hence the synthesised beam will 
be ~ 10 arcsec. This introduces an extra complication into 
predicting the number counts since many galaxies may be 
resolved and hence use of the point source approximation 
will overestimate the number of objects detected, particu- 
larly for shallow surveys. 

In order to model this effect we have introduced an up- 
per limit in the integral in eqn. [3] which is defined as the 
mass of object at a specific redshift which is resolved by 
the telescope beam. Of course, this then underestimates the 
number of objects found since some resolved objects are al- 
ways found in any survey. However, the number which are 
missed by this procedure are likely to be small since the vast 
majority of detections are just above mass threshold. 

The number of detections expected in one day are pre- 
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Figure 10. Cosmological parameter estimation for three surveys: the SDSS (top, blue); 2dFGRS (middle, red) and the proposed 100- 
beam FAST (bottom, yellow). Contours are la and 2<r respectively representing the 68% and 95% probability estimates. Due to the 
tension between the 2dFGRS and SDSS central values, the fiducial FAST cosmology was based on WMAP3. Note the large improvement 
in constraining this parameter space thanks to the order of magnitude increase in the number counts as well as in the volume probed. 



sented in Fig. [TT] for a range of angular resolutions between 
6 and 24 arcsec as a function of iobs- Included also for com- 
parison are predictions for FAST with 19 and 100 beams. 
The first thing to notice is that the FAST curves are al- 
most straight lines which peak at the lowest possible value 
of £ ba confirming our earlier assertion that the best observ- 
ing strategy for FAST, in terms of the number of objects de- 
tected, is to choose the value of i b s so that one would cover 
the whole sky available to FAST in a denned survey time. 
The curves for the 10% SKA have a very different shape, 
following a line similar to that for the FAST case for large 
values of i b s before eventually turning over and plumetting 
to zero. This is the effect of the finite resolution, with the 
maximum being where the majority of objects found in a 
survey fill the beam exactly. 

It is clear to see that for Afwhm > 15 arcsec the 10% 
SKA would make most of its detections in the point source 
regime for t a ^ s > 100 s. A higher resolution is more likely 
and if we consider the case of 12 arcsec resolution then the 
optimum value of i b s ~ 1000 s, but in terms of the number 
of objects found per day it is only marginally more powerful 
than for FAST with hb = 100. For higher resolution still 
the optimal value of t bs increases to around 2 x 10 4 s ss 
6 hours at 6 arcsec. In this case, one would be performing a 
much deeper survey which might capable of probing power 



spectrum at much higher redshifts and constraining the dark 
energy using the Baryonic Acoustic Oscillations. It would 
certainly be very di fferent in nature to that whi ch we have 
proposed for FAST (|Abdalla and Rawlings!l2004l ). 

To summarize if the 10% stage of the SKA has relatively 
low resolution and a FoV of ~ 1 deg 2 then it will be a factor 
of a few times more powerful than FAST, in terms of the 
number of galaxies found at relatively low redshifts. But if 
it has much higher angular resolution then the survey which 
it will yield will be dominated by objects at much higher 
redshifts. If this were to be the case the FAST and 10% 
SKA would be complementary. 



7 CONCLUSIONS 

In this paper we have shown that the upcoming FAST is an 
extremely capable instrument for large-scale neutral hydro- 
gen surveys over the whole sky with the potential for large 
numbers of galaxies being discovered daily in the redshift 
range — 0.2 and tens of galaxies out to mid-redshifts of 
order 0.2 — 0.4. This would provide complementary infor- 
mation to the redshift surveys already performed at optical 
wavelengths and would be considerably better than those 
presently available (or likely to be available in the near fu- 
ture) selected using neutral hydrogen as the tracer. 
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Figure 11. 10% SKA number counts (6ct) for a variety of surveying strategies defined by tobsi with a range of proposed angular 
resolutions. The resolutions considered here are 6 arcsec (solid), 12 arcsec (dot), 18 arcsec (dash) and 24 arcsec (large-dash). The 19- 
beam (dotted long-dash) and 100-beam (dotted short-dash) surveys possible with FAST are presented for comparison. As explained in 
the text the 10% SKA will be significantly limited by the effect of resolving out galaxies, hence the trend seen here of better resolution 
demanding a survey scheme that favours deeper exposures to probe higher redshifts and observe more galaxies as point-like. 



In calculating the number of galaxies expected we have 
taken into account the effect of different angles of inclination 
for the galaxies to the observer and shown that an increase 
of ~ 30% is expected over simplistic approaches. Essentially 
the increase arises from the detection of face-on galaxies 
which are otherwise neglected. A first-order estimate has 
also been made of the effect of the evolution of the hydrogen 
mass of a galaxy by means of a toy model based on a linear 
increase in the mass of M* with redshift. Although simplistic 
this nevertheless indicates that quite conservative increases 
in the neutral hydrogen content of a galaxy with redshift 
can lead to a factor of two increase in the expected number 
counts. This would lead to reductions in errors of the power 
spectrum due to shot noise, although we have not used this 
evolving HI mass function for the cosmological parameter 
constraints of Section [5] Prior to the main FAST survey 
we have suggested a deep, month-long survey to investigate 
these evolution effects by estimating the HI mass function 
out tozs; 0.6. 

For the FAST HI galaxy survey to be a major improve- 
ment over current optical surveys it will be necessary to 
detect ~ 10 7 galaxies over ~ 20000 deg 2 , most of which will 
lie within redshifts of a few tenths. A uniformly sampled 



~ 10 7 HI galaxy survey will allow nearly an order of mag- 
nitude improvement of cosmological parameter constraints 
than currently provided by SDSS. For example the FAST 
survey will, independent of CMB observations, constrain n s 
to 7% and V to 5%. Used in conjunction with observations 
of the CMB made by Planck this could reduce the volume 
of the overall cosmological parameter space. 

We have shown, however, that to achieve this perfor- 
mance in a reasonable time (~ 2 years) a receiving sys- 
tem providing of order 100 beams is necessary. Conventional 
horn-based systems with physical limits to their packing 
density are restricted to ~ 19 beams at the FAST prime 
focus and hence the 100-beam requirement can only be met 
with close-packed phased arrays of small antenna elements; 
these would not take up much more area than the 19-beam 
horn system. The required phased array technology is ac- 
tively being developed by several groups working on tech- 
nology for the SKA and hence we can be confident that by 
the time FAST comes into operation in 2012-2013 the 100- 
beam receiver capability will be available. 

We have briefly looked ahead to the first phase of the 
SKA and compared the shallower FAST surveys with those 
from the much higher resolution interferometric surveys 
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which will come somewhat later than that from FAST. The 
FAST and early-SKA surveys are complementary with the 
higher resolution SKA surveys preferentially finding higher 
redshift objects. 
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